An adaptive band-partitioning spectral entropy based speech detection in realistic noisy environments
نویسنده
چکیده
Generally, the feature parameters used for speech detection are highly sensitive to the environment. The performance of speech detection is severely degraded under realistic noisy environments since the characteristics of a speech signal cannot be fully expressed by those feature parameters. As a result, this study seeks the acoustic fingerprints of speech spectrogram as a robust feature to distinguish a speech from a non-speech, especially in adverse environments, and the fact that the frequency energies of difference types of noise are concentrated on different frequency bands [12], an ABSE (Adaptive Band-partitioning Spectral Entropy)-based speech detection algorithm is proposed to detect speech signals in adverse environments. Additionally, the ABSE-based algorithm is demonstrated to work in real-time with minimal processing delay. Experimental results indicate that the ABSE parameter is very effective for several SNRs (Signal to Noise Ratios) and various noise conditions. Furthermore, the proposed ABSE-based algorithm outperforms other approaches and is reliable in a real car.
منابع مشابه
Robust entropy-based endpoint detection for speech recognition in noisy environments
This paper presents an entropy-based algorithm for accurate and robust endpoint detection for speech recognition under noisy environments. Instead of using the conventional energy-based features, the spectral entropy is developed to identify the speech segments accurately. Experimental results show that this algorithm outperforms the energy-based algorithms in both detection accuracy and recogn...
متن کاملAdaptive high accuracy approaches to speech activity detection in noisy and hostile audio environments
This study examines the difficult task of Speech Activity Detection (SAD) in two hostile environments: AM push-to-talk air traffic control and international telephone conversations with very low SNRs. Due to the poor performance of traditional energy-based SAD, two novel approaches to SAD were developed that specifically target spectral characteristics that typify speech, rather than trying to ...
متن کاملNoise Estimation based on Entropy without using VAD for Speech Enhancement
A practical speech enhancement system consists of two major components, the estimation of noise power spectrum, and the estimation of speech.In single channel speech enhancement systems, most algorithms require an estimation of average noise spectrum since a secondary channel is not available. This requires a reliable speech/silence detector. Thus the speech/silence detection can be a determini...
متن کاملSpectral Entropy as Speech Features for Speech Recognition
This paper presents an investigation of spectral entropy features, used for voice activity detection, in the context of speech recognition. The entropy is a measure of disorganization and it can be used to measure the peakiness of a distribution. We compute the entropy features from the short-time Fourier transform spectrum, normalized as a PMF. The concept of entropy shows that the voiced regi...
متن کاملPerceptual wavelet adaptive denoising of speech
This paper introduces a novel speech enhancement system based on a wavelet denoising framework. In this system, the noisy speech is first preprocessed using a generalized spectral subtraction method to initially lower the noise level with negligible speech distortion. A perceptual wavelet transform is then used to decompose the resulting speech signal into critical bands. Threshold estimation i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004